Concepedia

Concept

speech enhancement

Parents

Children

5.1K

Publications

267.4K

Citations

7.5K

Authors

1.7K

Institutions

Probabilistic Model-Based Speech Enhancement

1991 - 1997

The period foregrounded a probabilistic, model-based paradigm that explicitly represents speech and noise statistics, enabling Bayesian MMSE/MAP estimators and hidden Markov model–driven decompositions to separate speech from interference. Iterative, constrained spectral-domain optimization emerged, employing EM-like updates and spectral constraints guided by perceptual considerations to refine speech spectra under noisy conditions. Auditory-inspired cues and perceptual strategies became central to preserving intelligibility, while robust feature extraction and perceptual quality assessment anchored evaluation and benchmarking across diverse acoustics. The convergent focus on robust recognition and speech restoration under realistic channels unified methodological threads across enhancement and recognition research.

Model-based probabilistic frameworks unify speech enhancement with recognition by explicitly modeling speech and noise statistics, enabling Bayesian MMSE/MAP estimators, HMM decompositions, and state-dependent dynamics to separate speech from interference [2], [10], [3], [11], [19].

Iterative, constrained spectral-domain optimization employs EM-like iterations and explicit spectral constraints, often guided by auditory constraints, to iteratively refine speech spectra under noisy conditions [1], [17], [13].

Auditory-inspired and perceptual strategies use auditory evidence, Lombard-effect considerations, and morphological spectral constraints to preserve intelligibility and robustness under varying acoustic environments [5], [4], [9].

Feature-level robustness for recognition in noise emphasizes robust spectral estimation and transform-based features, using energy-conditioned estimates, subband/envelope processing, and reliable filterbank representations to improve recognition [14], [12], [19], [18].

Quality assessment and perceptual evaluation align objective measures with subjective judgments, guiding development and benchmarking of enhanced and coded speech [8], [20].

Data-Driven Speech Enhancement

1998 - 2016

Unified Time-Phase Speech Enhancement

2017 - 2023